Phrasier: An Interactive System for Linking and Browsing Within Document Collections Using Keyphrases
نویسنده
چکیده
When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. Manual, or semi-automated link creation is often infeasibly time-consuming for large document collections. We present Phrasier, an interactive system which automatically introduces links to related material into documents as the user browses and queries a digital library collection. Suitable links are identified using keyphrases that are identified within document text and support both topicbased and inter-document navigation. Previews of link destinations are provided to reduce unproductive link traversals, and important segments of document text are identified and highlighted to support skimming of viewed documents. Evaluation has shown that PhrasierÕs keyphrase-based linking mechanism produces sparse hypertexts, although similar documents tend to have short paths between them. A study using human assessors in a simulated document retrieval task indicated that the generated links are perceived to be useful and relevant.
منابع مشابه
Design and Evaluation of Phrasier, an Interactive System for Linking Documents Using Keyphrases
When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. Manual, or semi-automated link creation is often infeasibly time-consuming for large document collections. We present Phrasier, an interactive system which automatically introduces links to related material into documents as the user browses and querie...
متن کاملLink as You Type: Using Key Phrases for Automated Dynamic Link Generation
When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. For large collections of thousands of documents it is prohibitively resource intensive to manually insert links into each document. Users of such collections may wish to relate documents within them to text that they are themselves generating. This pro...
متن کاملFinding nuggets in documents: A machine learning approach
However, many text mining applications do not have adequate natural language processing ability beyond simple keyword indexing, and as a result, there are too many textual elements (words) included in the analysis. We argue that noun phrases as textual elements are better suited for text mining and could provide more discriminating power, than single words. Discourse representation theory (Kamp...
متن کاملInteractive Demo: Stay in Touch with InfoVis – Visualizing Document Collections with Document Cards
Large document collections are essential resources for a wide variety of professionals, like scientists, lawyers, analysts, etc. An electronic document management system can assist them in solving the tedious tasks of curating, browsing, searching, and recognizing documents in these collections. As an initial step in creating such a system, we invented the Document Cards [3] as a mixed image-te...
متن کاملCross-language Entity Linking Adapting to User’s Language Ability
In this paper, we propose a method to automatically discover valuable keyphrases in Japanese and link these keyphrases to related Chinese Wikipedia pages. The method that we propose has four stages. Firstly, we extract nouns from a Japanese document using a morphological analyzer and extract the candidates of keyphrases using a method called Top Consecutive Nouns Cohesion (TCNC) [1]. Then, we j...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999